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Coding a data stream 



The invention relates to coding and decoding of a data stream. 

The invention further relates to transmission and reception of a data stream. 

Reference is made to the article of M. Budagavi, W. Rabiner Heinzelman, J. 
Webb, R. Talluri, "Wireless MPEG-4 Video Communication on DSP Chips", IEEE Signal 
Processing Magazine, January 2000. This article discloses that, to make the compressed bit- 
stream more robust, the MPEG-4 video compression standard incorporates several error 
resilience tools in its simple profile to enable detection, containment, and concealment of 
errors. These are powerful source-coding techniques for combating bit errors when they 
occur at rates less than 10" 3 ; however, present-day wireless channels can have much higher 
bit error rates (BERs). The harsh conditions on mobile wireless channels result from 
multipath fading due to motion between the transmitter and the receiver, and changes in the 
surrounding terrain. Multipath fading manifests itself in the form of long bursts of errors. 
Hence, some form of interleaving and channel coding is required to improve the channel 
conditions. Using a combination of source and channel coding, it is possible to achieve 
acceptable visual quality over error-prone wireless channels with MPEG-4 simple-profile 
video compression. The structure of an MPEG-4 compressed bit-stream also lends itself to 
using unequal error protection, a form of joint source-channel coding, to ensure fewer errors 
in the important portions of the bitstream. 

An object of the invention is to provide an improved error protection of a data 
stream. To this end, the invention provides coding, decoding, transmission, reception, a 
coded data stream and a storage medium as defined in the independent claims. Advantageous 
embodiments are defined in the dependent claims. 

The invention is based on the insight that in a coding scheme like MPEG-4, 
packets are not exactly of the same length and partitions have different lengths in different 
packets, due to the variable length coding used and to the requirement of having an integer 
number of macro-blocks in each packet. This implies that a fixed UEP scheme cannot be 
used and, in order to perform decoding with the correct code rate, the bit-stream structure 
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should be known at the receiver, at channel decoding level. Packets, like partitions, are not of 
the same length; thus the UEP scheme should be dynamically changed for each packet and 
the knowledge of the partition length is required. This problem is addressed by including in 
the data stream, information about the lengths of the partitions that are or have been 
5 protected, e.g. in the form of a length field. Such a length field may be added in each packet 
after the resync marker. At the receiver side, the length information is read. UEP channel 
decoding may then be performed with the knowledge of the length of each partition. 

Preferably, a specific, strong error protection is chosen for the length field, as 
the information it contains is crucial for the subsequent decoding. 
1 0 Preferably, the length field may comprise the lengths of the packet partitions 

after channel encoding (i.e. the lengths of the channel-encoded partitions). This is 
advantageous because these are the lengths of the packet partitions furnished to the channel 
decoder. 

Alternatively, the length field may contain the lengths of the packet partitions 
15 before channel coding. An advantage of including the lengths before channel encoding is that 
these lengths are shorter than the lengths after channel encoding, and can thus be more 
efficiently represented. Combining the lengths of the partitions before channel coding with 
the error protection rates results in the lengths of the channel-encoded partitions which are 
used in the channel decoder. 
20 In practical embodiments, after the length information has been read, the 

length field is deleted from the bit-stream, i.e. it is not inserted in the bit-stream fed into a 
source decoder (e.g. an MPEG-4 decoder). This modification is therefore transparent to the 
source decoder. 

The invention is especially advantageous in the field of wireless transmission 
25 of MPEG-4 video. 

The aforementioned and other aspects of the invention will be apparent from 
and elucidated with reference to the embodiments described hereinafter. 

In the drawings: 

30 Fig. 1 shows data partitioning in the MPEG-4 bit-stream; 

Fig. 2 shows a protection scheme in relation to an embodiment of the 

invention; 

Fig. 3 shows unequal error protection and length field insertion according to 
an embodiment of the invention; 
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Fig. 4 shows unequal error protection and length field insertion combined with 

start code substitution according to an embodiment of the invention; 

Fig. 5 shows a transmitter according to an embodiment of the invention, the 

transmitter comprising means for length field insertion; 
5 Fig. 6 shows a receiver according to an embodiment of the invention, the 

receiver comprising means for length field reading; 

Fig. 7 shows a transmitter according to an embodiment of the invention, the 

transmitter comprising means for length field insertion, and means for start code detection 

and substitution; and 

1 0 Fig. 8 shows a receiver according to an embodiment of the invention, the 

receiver comprising means for length field reading, and means for substituted start code 
detection and replacement. 

Due to compression and in particular to the use of predictive coding and 
Variable Length Coding (VLC), an MPEG-4 bit-stream is very sensitive to errors. The article 

15 of R. Talluri, "Error-resilient video coding in the ISO MPEG-4 standard", IEEE 

Communication Magazine, vol. 36, no.6, June 1998 describes error resilience aspects of the 
video coding techniques that are standardized in the ISO MPEG-4 standard. The specific 
tools adopted into the ISO MPEG-4 standard to enable the communication of compressed 
video data over noisy wireless channels are presented in detail. These techniques include 

20 ^synchronization strategies, data partitioning, reversible Variable Length Codes, and header 
extension codes. 

These tools help adding robustness to the MPEG-4 bit-stream. With the use of 
Resync markers, the MPEG-4 bit-stream results composed of packets, which are of almost 
the same length. Regardless of such tools, achievable received quality is still poor when 

25 MPEG-4 is transmitted over a wireless channel. Error resilience tools can, however, produce 
a further improvement of the received video quality if exploited at channel coding level. In 
particular, the data partitioning tool can be usefully exploited with the purpose of performing 
Unequal Error Protection (UEP): information bits contained in each packet are separated in 
three partitions, each of which has a different sensitivity to channel errors. As shown in Fig. 1 

30 for I frames, partitions consist of a header HI, DC DCT coefficients and AC DCT 

coefficients separated by a DC marker DCM. As far as P frames are concerned, partitions 
consist of a header HP, and a motion partition m and a texture partition tp separated by a 
motion marker mm. 
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In the following, a suitable technique taking into account the characteristics of 
both the wireless channel and of the application is described in relation to an embodiment of 
the invention. Specifically, information about the different sensitivity of source bits to 
channel errors should be exploited through UEP. This technique consists in performing error 
5 protection according to the perceived sensitivity of source bits to errors: more sensitive bits 
are protected with a higher protection (corresponding to a lower rate code), for less important 
bits a lower protection (i.e. a higher rate code) is used. Compared to classical Forward Error 
Correction (FEC), UEP allows achieving a higher perceived video quality given the same bit- 
rate, through the exploitation of the characteristics of the source. 

10 In the proposed scheme, the three partitions are protected with different code 

rates, according to the subjective importance of the relevant information. Information 
contained in headers is crucial for the successive decoding of the packet, thus those should be 
strongly protected. For intra frames, DC coefficients have a higher subjective importance 
than AC coefficients; thus the DC coefficients should be higher protected than the AC 

1 5 coefficients. As far as predicted frames are concerned, motion data should be more protected 
than texture data, as if motion information is correctly received texture information may be 
partially reconstructed. 

The UEP implementation proposed takes also into account the different 
importance of different types of frames: in the MPEG-4 standard. Intra, Predicted and 

20 Backward predicted frames are considered, where Intra frames are coded independently from 
the others and Predicted frames exploit information from contiguous frames. 

A correct reception of Intra frames is crucial to perform motion compensation 
of the subsequent Predicted frames, thus a lower average channel coding rate (i.e. a higher 
protection) could be associated to Intra frames, while Predicted frames can be coded with a 

25 higher average rate (i.e. a lower protection). Fig. 2 shows the described protection scheme. 

An MPEG-4 coded bit- stream is structured in Video Objects (VO), Video 
Object Layers (VOL), Groups of Video Object Planes (GOV), Video Object Planes (VOP), 
and Packets. In order to allow synchronization, a start of each part of the bit-stream is 
indicated by a relevant start code. Start codes are unique words, recognizable from any legal 

30 sequence of variable length coded words. In Fig. 2, HI indicates the start code for the VO, 
H2 the start code for the VOL, H3 the start code for the GOV, H4 the start code for the VOP 
and H5 the packet start code (resync marker). 

UEP may be performed through Rate Compatible Punctured Convolutional 
(RCPC) codes, with rates chosen according to a perceived importance of bits. In this case the 



PHIT010006 — - 




5 13.07.2001 
codes considered are obtained by puncturing the same "mother" code. Only one coder and 
one decoder are then needed for performing coding and decoding of the whole bit-stream. 
Rate Compatible Punctured Convolutional Codes as such are known from the article of J. 
Hagenauer, "Rate-Compatible Punctured Convolutional Codes (RCPC Codes) and their 
5 Applications", IEEE Trans. Commun., vol.36, no.4, pp. 389-400, April 1988. 

Different average code rates are taken into consideration for the protection of 
different frames (I frames are coded with a higher protection/ lower rate, a lower protection/ 
higher average rate is taken into consideration for P frames), and for each frame the data 
partitioning tool added to the MPEG-4 standard is exploited, in order to provide a stronger 
1 0 protection for the most significant partitions. 

Fig. 3 shows unequal error protection and length field insertion according to 
an embodiment of the invention. Information about the length of the partitions that are or 
have been protected is enclosed in the coded data stream, e.g. in a length field If added in 
each packet after the resync marker H5. A specific, strong error protection is chosen for the 
1 5 length field, as the information it contains is crucial for the subsequent decoding. At the 

receiver side, after the detection of a resync marker, the length information is read (see Fig. 
5). UEP channel decoding may then be performed with the knowledge of the length of each 
partition. 

In this case, if /j, / 2 , h are the lengths of the three partitions before channel 
20 encoding, the length of the coded packet including the length field will be: 

j _ ^length _field jjj U l 3 + M 



~* coded _ packet 



^length _ field ^1 ^2 ^3 

where M is the memory of the code, in the case convolutional codes are 

considered. 

As for the memory M of the code: convolutional codes differ from block codes 
25 in that the encoder contains memory and the encoder outputs at any given time unit not only 
depend on the inputs at that time unit, but also on M previous input blocks, where M is the 
memory of the code. A memory M convolutional encoder consists of an M-stage shift 
register with the output of selected stages being added modulo-2 to form the encoded 
symbols. Since a convolutional coder is a sequential circuit, its operation can be described by 
30 a state diagram. The state of the encoder is defined as its shift register contents; thus an 

encoder may assume 2 M states. In order to protect the last bits of the bit-stream with the same 
strength of the others, M tail bits should be added to the bit-stream in order to force the 
encoder to converge back to a known state (typically the "0" state). In fact, if convolutional 
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codes are considered, the packet is terminated by shifting M "0" bits into the shift register in 
order to allow a proper termination of the trellis. Tail bits are coded with the higher rate. In 
order to compute the total average rate, the average between I frames and P frames should be 
computed and overhead (introduced by e.g.the length fields) should also be considered. 
5 Preferably, the length field comprises the lengths of the packet partitions after 

channel encoding, i.e. — , — / 3 + — , because these are the lengths of the packet partitions 

furnished to the channel decoder. 

After the length information has been read, the length field is deleted from the 
bit-stream, i.e. it is not inserted in the bit-stream fed into the MPEG-4 decoder. As seen for 

1 0 the substitution of the original start codes with the "wireless" ones, also this modification is 
therefore transparent to the MPEG-4 decoder. 

Fig. 4 shows a first transmitter according to an embodiment of the invention. 
A data stream SI is received in a packet buffer 10. The first transmitter further comprises a 
start code detector for detecting resync markers H5 in the data stream SI. Packets of the data 

1 5 stream S 1 , present in between the markers H5, are channel encoded in a channel coder 1 1 to 
obtain channel coded packets. These channel coded packets are furnished to a multiplexer 14 
and are included in a data stream WS1 to be transmitted. The transmitted data stream WS1 is 
furnished to an antenna, e.g. for wireless transmission, or to a storage medium 15. To 
perform length field insertion, the first transmitter comprises a length field insertion unit 20 

20 that furnishes a length field If to the multiplexer 14 which includes the length field in the 

transmitted data stream WS1 (see also Fig. 3). In this embodiment, the length field insertion 
unit 20 is controlled by the channel encoder 1 1 and/or the start code detection unit 12. 

Fig. 5 shows a first receiver for receiving a data stream WS 1 transmitted by an 
embodiment according to Fig. 4. If a resync marker H5 is detected in a start code detector 32, 

25 a packet buffer 30 is initialized and subsequent bits fill the buffer until the next start code is 
detected. When the next start code is detected, the buffer includes one packet. In a channel 
decoder 3 1 , decoding is performed on the bits in the buffer, according to the VOP indicator 
information and to the length information included in the length field If. The length field is 
detected and read in the length field detector 40. The rates used in the scheme are preferably 

30 fixed and the same as used in the channel coder. In the case of variable rates, the rates have to 
be received from the channel encoder in the transmitter. The channel-decoded packets form 
the channel decoded data stream SI which is furnished to a source decoder (not shown), e.g. 
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an MPEG-4 decoder. Note that if RCPC codes are used, de-puncturing is performed before 
decoding. In this case, the packet is then decoded at the mother code rate. 



The length field insertion as described above is advantageously applied in 
combination with start code substitution. At channel coding level, an advantageous 
embodiment according to the invention is proposed, in which length field insertion is 
combined with start codes substitution. Start codes substitution addresses the problem that 
(MPEG-4) start codes are not robust to errors: a single error in a start code may cause missed 
detection, resulting in a loss of synchronization. 

In start code substitution, at least one marker out of a predetermined set of at 
least two mutually different markers, the marker indicating a start of a given part of the data 
stream, is represented in the outputted data stream by a higher-robustness word having a 
higher robustness to channel errors than the at least one marker. The higher-robustness word 
may be a higher-robustness word with higher correlation properties than the respective 
marker, and is preferably a pseudo-noise word. Using higher-robustness words with higher 
correlation properties to represent markers makes transmission of these markers more robust 
against transmission errors. In a receiver, a given higher-robustness word is preferably 
detected by correlating the received data stream with higher-robustness words obtained from 
a predetermined set of higher-robustness words. If the correlation of the received data stream 
with a given higher-robustness word out of the predetermined set yields a value above a 
given threshold, the given higher-robustness word is decoded to obtain a corresponding 
marker at the position of the higher-robustness word. The higher-robustness word is 
preferably substituted by the corresponding 'original 5 marker. This has the advantage that 
'original' / unaffected markers are present in the MPEG-4 data stream at the receiver after 
channel decoding. This embodiment of the invention therefore provides advantageous error 
protection by transparent substitution of start codes with higher-robustness words. 

Preferably, the data packets in the data stream are coded according to a 
channel coding mechanism different from spread-spectrum coding. 

Advantageously, at the transmitter side, respective markers are substituted 
with respective higher-robustness words obtained from a predetermined set of higher- 
robustness words, each higher-robustness word in the set of higher-robustness words 
representing a given marker in the predetermined set of markers. By substituting the markers 
by a corresponding higher-robustness word, fast and advantageous coding is provided. The 
higher-robustness words can fast and easily be obtained from a look-up table. Coding errors 
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that could be obtained when the markers are coded with a pseudo-noise sequence impressed 
on the marker are avoided. 

Although substitution of the markers with respective new higher-robustness 
words obtained from a predetermined set of higher-robustness words is advantageous, the 
higher-robustness words with higher correlation properties may alternatively be obtained by 
impressing a fixed pseudo-noise sequence on the markers at a modulator. In this 
embodiment, it is possible in the decoder to obtain the original markers by removing the 
fixed pseudo-random sequence from the higher-robustness words at a demodulator. 

John G. Proakis, 'Digital communications', 2 nd edition, McGraw-Hill, 1989, 
pp. 801-817 discloses spread-spectrum signals for digital communications. Spread-spectrum 
signals used for the transmission of digital information are distinguished by the characteristic 
that their bandwidth W is much greater than the information rate R in bits per second. That is, 
the bandwidth expansion factor B e = W / R for a spread-spectrum signal is much greater than 
unity. The large redundancy inherent in spread-spectrum signals is required to overcome the 
severe levels of interference that are encountered in the transmission of digital information 
over some radio and satellite channels. Proakis discloses a spread-spectrum digital 
communications system with a binary information sequence at its input at the transmitting 
end and at its output at the receiving end. The channel encoder and decoder and the 
modulator and demodulator are basic elements. In addition to these elements, two identical 
pseudo-random pattern generators, one which interfaces with the modulator at the 
transmitting end and the second which interfaces with the demodulator at the receiving end. 
The generators generate a pseudo-random or pseudo-noise (PN) binary- valued sequence, 
which is impressed on the transmitted signal at the modulator and removed from the received 
signal at the demodulator. Synchronization of the PN sequence generated at the receiver with 
the PN sequence contained in the incoming received signal is required in order to demodulate 
the received signal. Initially, prior to the transmission of information, synchronization may be 
achieved by transmitting a fixed pseudo-random bit pattern, which the receiver will recognize 
in the presence of interference with a high probability. After time synchronization of the 
generator is established, the transmission of information may commence. Generation of PN 
sequences is further explained on pages 831-836. 

In the following, advantageous embodiments are decribed for the simplified 
case of VOP's coincident with frames. 
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In the proposed scheme, start codes in a data stream S2 are substituted (see 



Fig. 6) with pseudo-noise words WH1 . . . WHS, which are sequences with high correlation 
properties (e.g. Gold sequences). These new start codes are denoted by Wireless Start Codes. 
In particular, a substitution is performed for VO, VOL, VOP, GOV start codes and for the 
5 Resync marker. The data stream in Fig. 6 does not include the GOV start code (H3), 

considering the MPEG-4 bit-stream. In the MPEG-4 bit-stream there is no GOV start code 
(H3) after the VOL start code (H2), because the VOL start code (H2) also indicates the 
beginning of a GOV. 



10 estimated through correlation before the channel decoding process; a trade-off should be 
achieved between the probability of missing a start code and the probability of start codes 
emulation, thus the choice of the wireless start codes length and of a proper threshold for the 
correlation is performed accordingly. As the detection is performed, wireless start codes 
WH1...WH5 are substituted with the corresponding start codes H1...H5 from an original set 

15 of start codes. The described substitution is herewith transparent to the source decoder. 



invention, similar to the transmitter of Fig. 5, but arranged to further perform start codes 
substitution. In the second transmitter the start code detector 12 is arranged to detect the 
20 codes HI ...H5 in a data stream S2. A detected start code is substituted by a corresponding 
pseudo-noise word WH1...WH5 by a pseudo-noise word generator 13. The pseudo-noise 
word is furnished to the multiplexer 14 that includes the pseudo-noise word in the data 
stream WS2 to be transmitted. 



25 embodiment similar to Fig. 7. In the start codes detector 32 (e.g. a pseudo-noise word 

detector), correlation evaluations are performed between each allowed pseudo-noise word 
(i.e. from the predetermined set of pseudo-noise words, corresponding to the markers) and 
the relevant bit-stream portion in order to detect pseudo-noise words representing start codes. 
Correlations are compared with corresponding thresholds th. When a pseudo-noise word is 

30 detected, the bit indicator in the bit-stream shifts the proper number of bits and the 

corresponding MPEG- 4 start code H1...H5 is provided by start code generator 33, which 
start code is inserted in a multiplexer 34 whose task is to arrange a bit-stream S'to be fed to 
the MPEG-4 decoder. If either a GOV start code or a VOP start code is detected, a VOP 
indicator changes its status. 



At the receiver side, the position of these wireless start codes WH1...WH5 are 



Fig. 7 shows a second transmitter according to an embodiment of the 



Fig. 8 shows a receiver for receiving a data stream WS2 transmitted by an 
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Preferably, no correlation evaluation is performed until the buffer contains N 
bits, where N is the minimum length of a packet. 

Although not shown in Figs. 4-5, 7-8, the data stream may be modulated 
before transmission by a modulator in the transmitter and consequently be demodulated in 
5 the receiver by a demodulator before decoding is performed. 

In the length field, instead of in absolute length values, the lengths of the 
respective packet partitions may also be given in percentages of the packet length. 

In the case the respective lengths of the packet partitions remain constant over 
a multiple of (subsequent) packets, it suffices to include length information in one of these 
10 packets, e.g. a first packet. It is further possible to use differential lengths in a given length 
field, the differential lengths indicating differences between the current respective packet 
partition lengths and previous respective packet partition lengths. 

Because some packet partitions are usually shorter than other packet partitions, 
the number of bits reserved in the length field for indicating the lengths of these usually 
1 5 shorter packet partitions is preferably less than the number of bits reserved for indicating the 
lengths of longer packet partitions. For example, the header of a packet is usually shorter than 
the other packet partitions. Therefore, the number of bits reserved in the length field to 
indicate the length of a header is preferably smaller than for other, longer packet partitions. 

It should be noted that the above-mentioned embodiments illustrate rather than 
20 limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. The 
word 'comprising' does not exclude the presence of other elements or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 
25 distinct elements, and by means of a suitably programmed computer. In a device claim 

enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 



